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SUMMARY 

The clinical application of CCR5 antagonists involves first determining the coreceptor usage by the infecting viral strain. 
Bioinformatics programs that predict coreceptor usage could provide an alternative method to screen candidates for treatment with 
CCR5 antagonists, particularly in countries with limited financial resources. Thus, the present study aims to identify the best approach 
using bioinformatics tools for determining HIV-1 coreceptor usage in clinical practice. Proviral DNA sequences and Trofile results 
from 99 HIV-1 -infected subjects under clinical monitoring were analyzed in this study. Based on the Trofile results, the viral variants 
present were 81.1% R5, 21.4% R5X4 and 1.8% X4. Determination of tropism using a Geno2pheno, . analysis with a false 
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positive rate of 10% gave the most suitable performance in this sampling: the R5 and X4 strains were found at frequencies of 78.5% 
and 28.4%, respectively, and there was 78.6% concordance between the phenotypic and genotypic results. Further studies are needed 
to clarify how genetic diversity amongst virus strains affects bioinformatics-driven approaches for determining tropism. Although this 
strategy could be useful for screening patients in developing countries, some limitations remain that restrict the wider application of 
coreceptor usage tests in clinical practice. 
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INTRODUCTION 

Maraviroc, which blocks virus entry by binding to the CCR5 
coreceptor, was the first CCR5 antagonist approved for clinical use. This 
molecule has been widely recommended for controlling HIV infection 
with a low level of drug resistance 614 " 8 . However, the use of this drug 
class in clinical practice requires the assessment of the coreceptor usage 
before and during the treatment 20 . Studies comparing the efficiency of 
phenotypic and genotypic assays, in order to develop widely applicable 
strategies useful in developing countries, have raised several issues 
regarding the determination of HIV-1 tropism 7 IS . 

Although phenotypic tropism tests (PTT) are considered the gold 
standard for determining coreceptor usage, their wide application in 
clinical practice is limited mainly by their high cost and the logistical 
restrictions of the procedures 1 - 7 . In this context, programs able to 
predict coreceptor usage based on V3 sequences have provided a 
suitable alternative for screening candidates for CCR5 antagonist 
therapy 23 . 

Several bioinformatics protocols for tropism prediction have been 
published. They assess tropism based on amino acids sequences from 



the V3 loop, which is known to be an important binding region on the 
gpl20 envelope protein 19,23 . Because the use of phenotypic assays is still 
limited, the European Guidelines have encouraged the application of 
bioinformatics programs in coreceptor usage determination 27 . However, 
due to the intrinsic differences of each predictive system, divergent 
outputs are expected and remain a reason for concern in the wider 
application of this approach 9151 ''' 23 " 24 . 

The aim of this study was to examine the use of the Geno2pheno, 

•f r Icoreceptor] 

program for tropism prediction and determine its usefulness as an 
alternative method in clinical practice for screening CCR5 antagonist 
therapy candidates for the first time in Brazil. 

MATERIAL AND METHODS 

Sampling: Sample collection occurred between July 2009 and 
October 2010, from the HIV outpatient clinic ADEE3002/HCFMUSP. 
Only 99 from this cohort showed the minimum plasmatic RNA viral 
load necessary to qualify for the Trofile assay (> 1000 copies/mL, as 
recommended by Monogram Biosciences, San Francisco, CA). The 
written informed consent for collecting blood samples and the protocol 
for this study were approved by the Ethical Research Board of the 
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Hospital das Clinicas, Faculdade de Medicina da Universidade de Sao 
Paulo (Cappesq - HC/FMUSP, number 08/0108). 

Subjects: A total of 99 patients under clinical monitoring were 
included in this study, of whom 29 (29.3%) were females and 70 (70.7%) 
were males. With respect to antiretroviral therapy, 62 (62.6%) patients 
were drug-naive, and 37 (37.4%) were under HAART. Only subjects 
being treated with nucleoside and non-nucleoside reverse transcriptase 
inhibitors (NRTI and NNRTI) and protease inhibitors (PI) were studied. 

After signing the informed consent form, 15 mL of venous blood 
was collected in EDTA-treated tubes from each patient. Plasma samples 
were separated and stored at -70 °C until their shipment to Monogram 
Biosciences to perform the Trofile assay. Peripheral blood mononuclear 
cells (PBMCs) were isolated from blood samples by Ficoll-Hypaque 
density gradient centrifugation and stored at -70 °C until use. DNA was 
extracted from PBMCs using the GFX Genomic Blood DNA Purification 
Kit (GE Healthcare, Little Chalfont, UK) according to the manufacturer's 
instructions. 

The HIV- 1 plasmatic viral load was measured using the Versant HIV 
RNA 3.0 Assay (bDNA) (Siemens Healthcare Diagnostics, San Francisco, 
USA), and CD4+/CD8+ T cell counts were determined by flow cytometry. 
These data were collected during routine clinical monitoring of patients 
and were obtained from the laboratory database. 

In addition, 185 sequences containing known coreceptor sequence 
variations were obtained from the HIV Sequence Database (www.hiv- 
web.lanl.gov) to assess the application of genotypic tropism test (GTT) 
in identifying different HIV-1 variants. Only V3 sequences with 35 
amino acids were evaluated; sequences with 34 or 36 amino acids were 
excluded. All deposited sequences originating in Brazil were included, 
and at least 5% of the sequences for each group were selected from other 
countries of origin (Table 1). 



Table 1 

Distribution of HIV database sequences by subtype and phenotypic tropism 



Subtype 


R5 


R5X4 


X4 


Total 


B 


115 


18 


13 


146 


C 


23 


7 


5 


35 


Fl 


2 


1 


1 


4 


Total 


140 


26 


19 


185 



V3 region amplification and sequencing: The V3 region was 
amplified by nested polymerase chain reaction (PCR) using the previously 
described outer primers ED5/ED12 and inner primers ED31/ED33 4 . The 
PCR using the outer primers was performed with the following steps: 
one cycle of 94 °C for one min; 35 cycles of 94 °C for 45 sec, 55 °C 
for 45 sec, and 72 °C for two min; and a final extension cycle of 72 °C 
for 10 min. The PCR using the inner primers was performed with the 
following steps: one cycle of 94 °C for one min; 35 cycles of 94 °C for 
one min, 55 °C for one min and 72 °C for one min and 30 sec; and a 
final extension cycle of 72 °C for 10 min. The 50-uL reaction mixtures 
contained 2 mM MgCl,, 2 uM of each primer, 0.2 mM of each dNTP, 
1 U Taq DNA polymerase and 5 uL of extracted DNA for the outer PCR 
or 5 uL of PCR product for the inner PCR. 



The inner PCR products were purified using the QIAquick PCR 
fragment purification kit (Qiagen, Hilden, Germany) following the 
manufacturer's instructions. The sequencing reaction was carried out 
using the ABI Prism Big Dye Terminator v.3. 1 Cycle Sequencing Ready 
Reaction Kit (Applied Biosystems, Foster City, USA), adding 20-100 ng 
of amplified and purified product and each inner PCR primer at 1 uM. The 
reaction was carried out according to the manufacturer's instructions, and 
the sequencing was performed using the ABI Prism 3 100 system (Applied 
Biosystems, Foster City, USA). The resulting nucleotide sequences were 
edited using SeqScape Software v.2.6 (Applied Biosystems). 

HIV- 1 envelope subtyping was performed by manual phylogenetic 
analysis using the 2008 Alamos reference dataset (available at 
http://www.hiv.lanl.gov/content/sequence/NEWALIGN/align.html). 
Sequences were aligned and edited, and a tree was constructed with 
the Neighbor- Joining method and the Maximum Composite Likelihood 
model with 1000 bootstrap replicates using the MEGA v.5 package 25 . 

Tropism determination: Coreceptor usage was determined using 
the Geno2pheno, , bioinformatics tool (G2P) 10 . False positive rates 
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(FPR) were 10% for the proviral DNA sequences obtained in this study 
or 20% for clonal and clinical data. 

The Trofile Coreceptor Tropism Assay (Monogram Biosciences, 
San Francisco, USA) was performed on plasma samples from patients 
showing viral load > 1000 copies/mL. 

RESULTS 

Comparison between PTT and GTT: From the 99 samples sent for 
analysis with the Trofile assay, 55 (55.6%) were identified as R5 viruses, 
22 (22.2%) were R5X4, and only one (1%) sample showed tropism to 
the CXCR4 coreceptor. The Trofile assay was unable to generate results 
for 2 1 (2 1 .2%) samples. Therefore, the coreceptor prevalence within the 
assayable sample population was 70.5% (55/78) R5, 28.3% (22/78) R5X4 
and 1.3% (1/78) X4 strains. 

Readable sequences were obtained from 70 proviral DNA sequences, 
resulting in the generation of both Trofile and G2P results for 56 samples. 
These matched sample measures allowed for sensitivity and specificity 
testing, which showed that a FPR of 10% gave the highest sensitivity 
and specificity (Table 2). 



Table 2 

Sensitivity and specificity test for Geno2pheno [coreceptor] false positive rates in 
prediction of CXCR4 coreceptor usage 



G2P FPR 


Sensitivity (%) 


Specificity (%) 


10% 


54.5 


84.4 


20% clinical data 


50.0 


82.6 


20% clonal data 


34.8 


68.1 


Note: G2P FPR: Geno2pheno [( 


roreceptorj" 





Based on the G2P FPR of 10%, 55 (78.5%) and 15 (21.4%) of the 
70 sequences obtained in this study were predicted to be R5 and X4 
strains, respectively. There were no differences between the Trofile 
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results (R5 = 70.5%; X4 and R5X4 = 29.5%) and G2P results found 
for this population. 

Envelope subtyping was performed on the sequences obtained in this 
study. Only B (n = 46), C (n = 2) and Fl (n = 8) subtypes were found, 
and it was not possible to identify any recombinant forms. Tropism 
prediction showed good accuracy regardless of the subtype, generating 
concordances between the Trofile and G2P results of 76. 1 % for subtype 
B samples, 100% for subtype C and 87.5% for subtype Fl. Due to the 
small number of clinical samples in our study, a dataset containing 185 
sequences from the Los Alamos HIV-1 sequence database was also 
analyzed. The prediction accuracy for the samples in this dataset was 
represented by concordances between the Trofile and G2P results of 
88.3% for subtype B, 88.6% for subtype C and 100% for subtype Fl 
both sequences available in this database. 

DISCUSSION 

Phenotypic tropism tests (PTT) still represent the gold standard 
in tropism determination, but high cost and logistical restrictions 
are significant barriers preventing the wide application of these 
assays, particularly in developing countries 7 - 3 . Thus, the adoption of 
bioinformatics tools for determining coreceptor usage could provide a 
more accessible alternative for screening candidates for CCR5 antagonist 
therapy 1 ''- 21 . 

The phenotyping test has some limitations in the clinical setting. 
While the commercially available Trofile assay is the most widely used 
phenotypic test 1 - 7 and provides accurate determination of HIV tropism, 
it was unable to generate results for 2 1 % of the samples in this study. An 
additional important limitation of the Trofile assay is the requirement that 
samples have a minimum plasmaviral load of 1000 copies/mL. Based 
on studies that identified similar tropism behavior in 75% to 90% of 
sequences from plasma and PBMCs, using proviral DNA as the source 
of viral genetic material could be the most reliable option for obtaining 
V3 sequences. Moreover, X4 viruses were detected more frequently in 
DNA samples from PBMCs than in RNA samples from plasma 13 - 20 - 28 . In 
fact, the Trofile test is mostly used in clinical trials and, in the meantime, 
has been replaced in the clinical setting by genotyping tests (at least in 
Europe). Furthermore, the Trofile test is not an entirely phenotypic assay, 
as a library of V3 loop sequences of HIV-RNA from the test sample is 
prepared by PCR amplification. There is bias in this PCR amplification, 
as is in any PCR. Only the read out of the Trofile test is phenotypic. In 
contrast, the MT2 assay is entirely phenotypic. Bias is also present in 
population based genotypic testing by the PCRs and the sequencing 
reactions used. 

Bulk sequencing of proviral DNA is an intrinsically limited 
technique, because it produces a consensus sequence from dominant 
strains within the viral quasispecies 7 - 11 '. Although it has been suggested 
that this problem could be mitigated by increasing the number of 
replicate sequences produced from each sample, this modification also 
increases the method's costs and time required and could further limit its 
use, particularly in countries with restricted financial resources 7 -". The 
present study aimed to develop an accurate and cost-effective strategy 
for applying tropism testing in clinical practice. Although the European 
Guidelines recommend performing G2P with an FPR of 10% for triplicate 
samples and 20% for samples with a single sequence 27 , the present study's 



sensitivity and specificity tests showed that a G2P FPR = 10% is the best 
significance level, even for single sequences (Table 2). 

This study showed that the concordance between PPT and GTT 
was also high for different subtypes (mean of concordance = 88%), 
and this finding was confirmed by testing a large dataset from the Los 
Alamos Sequence Database (mean of concordance = 92%). These results 
corroborate previous studies that showed 80-90% agreement between 
phenotypic and genotypic tests for determining tropism 2 - 1617 - 26 . Despite 
the high concordance between the phenotypic and genotypic assays, 
there has been disagreement between the results generated by different 
bioinformatics-based assays, and several approaches to improving these 
predictions have been proposed 2 " 3 2122 26 . Additional studies are needed to 
determine if viral sequences in the Brazilian patient population might be 
so distinct as to invalidate their analysis with available bioinformatics 
tools, which derive their datasets mainly from sequences found in patients 
from the USA and European countries. 

Furthermore, while the determination of HIV-1 tropism has focused 
on the V3 loop, other features of gp 1 20 could influence virus affinity for 
a particular coreceptor, including N-glycosylation sites and variations in 
extensions of the V 1/V2 region 812 . Information on the three-dimensional 
structure of the V3 loop and clinical and laboratory data from patients, 
such as T cells counts and viral load, could also substantially improve 
tropism predictions 19 . 

CONCLUSIONS 

The application of bioinformatics tools in coreceptor usage 
determination has been widely studied and could provide an alternative 
approach in clinical practice for screening candidates for CCR5 
antagonist therapy, especially in cases where confirmatory assays for 
coreceptor usage determination are unavailable. This study showed that 
G2P analysis of sequences from PBMCs with an FPR of 10% could be 
the most suitable alternative approach for determining tropism, although 
this approach should be tested in a nationwide study to determine how 
HIV genetic diversity influences the findings. 

RESUMO 

Determinant) do tropismo viral por ensaios genotipicos e 
fenotipicos em pacientes brasileiros infectados por HIV-1 

A aplica§ao clmica dos antagonistas de CCR5 envolve em primeiro 
lugar determinar o uso de co-receptor pela cepa viral infectante. 
Programas de bioinformatica que preveem o uso co-receptor poderiam 
fornecer um metodo alternativo para selecionar candidates para o 
tratamento com os antagonistas do CCR5, particularmente em pafses 
com poucos recursos financeiros. Assim, o presente estudo teve por 
objetivo identificar a melhor abordagem utilizando ferramentas de 
bioinformatica para determinar qual o tipo de co-receptor do HIV-1 
que poderia ser usado na pratica clmica. Sequencias de DNA proviral 
e Trofile resultados a partir de 99 pacientes infectados pelo HIV-1 sob 
monitorizacao clmica foram avaliadas. Com base nos resultados do Teste 
Trofile, as variantes virais presentes eram R5 (81,1%), R5X4 (21,4%) 
e X4 (1,8%). Determinacao do tropismo pela analise do Geno2pheno, 
com taxa de falso positivos de 10% apresentou desempenho mais 
adequado para esta amostragem: as cepas R5 e X4 foram encontradas 
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em frequencias de 78,5% e 28,4%, respectivamente, e foi de 78,6% a 
concordancia entre os resultados fenotfpicos e genotfpicos. Mais estudos 
sao necessarios para esclarecer como a diversidade genetica entre as 
cepas do virus afeta abordagens baseadas na determinacao do tropismo 
pelas ferramentas de bioinformatica. Embora esta estrategia possa ser util 
para o rastreio de pacientes em paises em desenvolvimento, permanecem 
algumas limitacoes que restringem a aplicacao mais ampla para utiliza§ao 
de testes de co-receptor na pratica clmica. 
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